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[57] ABSTRACT 
Compounds having the structure: 
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wherein B represents a purine, 7-deazapurine, or pyrirni- 
dine moiety covalently bonded to the exposition of 
the sugar moiety, provided that when B is purine or 
7-deazapurine, it is attached at the Imposition of the 
purine or 7-deazapurine and when B is pyrimidine, it is 
attached at the ^-position; 

wherein A represents a moiety consisting of at least three 
carbon atoms which is capable of forming a detectable 
complex with a polypeptide when the compound is 
incorporated into a double-stranded ribonucleic acid, 
deoxyribonucleic acid duplex, or DNA-RNA hybrid; 

wherein the dotted line represents a chemical linkage 
joining B and A, provided that if B is purine, the 
linkage is attached to the 8-position of the purine, if B 
is 7-deazapurine, the linkage is attached to the 7-posi- 
tion of the deazapurine, and if B is pyrimidine, the 
linkage is attached to the 5-position of the pyrimidine; 
and 

wherein each of x, v and z represents 
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either directly, or when incorporated into oligo- and poly- 
nucleotides, provide probes which are widely useful. 

Applications include detection and localization of poly- 
nucleotide sequences in chromosomes, fixed cells, tissue 
sections, and cell extracts. Specific applications include 
chromosomal karyotyping, clinical diagnosis of nucleic 
arid<ontaimng etiological agents, e.g. bacteria, viruses, or 
fungi, and diagnosis f genetic disorders. 

19 Claims, N Drawings 
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MODIFIED NUCLEOTIDES AND 
POLYNUCLEOTIDES AND COMPLEXES 
FORM THEREFROM 

This invention was made with government support under 5 
grant numbers P50 GM 20124, 132 GM 07499 and T32 CA 
09159 awarded by the National Institutes of Health of the 
Department of Health and Human Services. The Govern- 
ment has certain rights in the invention. This is continuation 1Q 
of application Sen No. 07/130,097, filed Dec. 7, 1987, now 
abandoned, which is in turn a division of application Sen 
No. 06/496,915, filed May 23, 1983, issued Dec 8, 1987 as 
U.S. Pat No. 4,711,955, which is in turn a continuation of 
application Ser. No. 06/255,223, filed Apr. 17, 1981, now 15 
abandoned. 



BACKGROUND OF THE INVENTION 
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Many procedures employed in biomedical research and 
recombinant DNA technology rely heavily on the use of 
nucleotide of polynucleotide derivatives radioactively 
labeled with isotopes of hydrogen ( 3 H), phosphorous f 3 ^), 
carbon ( 14 Q, or iodine (™I). Such radioactive compounds ^ 
provide useful indicator probes that permit the user to detect, 
monitor, localize, or isolate nucleic acid; and other mol- 
ecules of scientific or clinical interest, even when present in 
only extremely small amounts. To date, radioactive materi- 
als have provided the most sensitive, and in many cases the 3Q 
only, means to perform many important experimental or 
analytical tests. There are, however, serious limitations and 
drawbacks associated with the use of radioactive com- 
pounds. First, since personnel who handle radioactive mate- 
rial can be exposed to potentially hazardous levels of 35 
radiation, elaborate safety precautions must be maintained 
during the preparation, utilization, and disposal of the radio- 
isotopes. Secondly, radioactive nucleotides are extremely 
expensive to purchase and use, in large part due to the cost 

f equipment and manpower necessary to provide the appro- ^ 
priate safeguards, producer/user health monitoring services, 
and waste-disposal programs. Thirdly, radioactive materials 
are often very unstable and have a limited shelf-life, which 
further increases usage costs. This instability results from 
radiolytic decomposition, due to the destructive effects asso- 45 
dated with the decay of the radioisotope itself, and from the 
fact that many isotopes e.g. 32 p and 2& I) have half-lives of 
only a few days. 

It is known that haptens can combine with antibodies, but 
can initiate an immune response only if bound to a carrier. 50 
This property can be exploited in detection and identification 



It is also known that biotin and iminobiotin strongly 
interact with avidin, a 68,000 dalton glycoprotein from egg 
white. This interaction exhibits one of the tightest, non- 55 
covalent binding constants (1(^=1 0~ 15 ) seen in nature. If 
avidin is coupled to potentially demonstrable indicator mol- 
ecules, including fluorescent dyes, e.g. fluorescein or 
rhodamine; electron-dense reagents, e.g. ferritin, hemocya- 
nin, or colloidal gold; or enzymes capable of depositing 60 
insoluble reaction products, e.g. peroxidase or alkaline phos- 
phatase, the presence, location, or quantity of a biotin probe 
can be established. Although iminobiotin binds avidin less 
tightly than biotin, similar reactions can be used for its 
detection. Moreover, the reversibility of the iminobiotin- 65 
avidin interaction, by decreasing solution pH, offers signifi- 
cant advantages in certain applications. 



The specificity and tenacity of the biotin-avidin complex 
has been used in recent years to develop methods for 
visually localizing specific proteins, lipids, or carbohydrates 
on or within cells (reviewed by E. A. Bayer and M. Wilchek 
in Methods of Biochemical Analysis, 26, 1, 1980). Chro- 
mosomal location of RNA has been determined by electron 
microscopy using a biotinized protein, cytochrome C, 
chemically crosslinked to RNA as a hybridization probe. 
The site of hybridization was visualized through the binding 
of avidin-ferritin or avidin-methacrylate spheres mediated 
by the avidin-biotin interaction. (J. E. Manning, N. D. 
Hershey, X R. Broker, M. Pellegrini, H. K. Mitchell, and N. 
Davidson, Chromosoma, 53, 107, 1975; J. E. Manning, M. 
PeDegrini, and N. Davidson, Biochemistry, 61, 1364, 1977; 
X R. Broker, L. M. Angerer, P. H. Yen, N. D . Hen ey, and N. 
Davidson, Nucleic Acid Res., 5, 363, 1978; A Sodja and N. 
Davidson, Nucleic Acid Res., 5, 383, 1978.) This approach 
to the detection of polynucleotide sequences, although suc- 
cessful in the specialized cases examined which were highly 
reitterated sequences, is not of general utility for analysis of 
polynucleotides present in single or low copy number. 

Moreover, methods for attanhipg chemical moieties to 
pyrimidine and purine rings are known. Several years ago a 
simple and rapid acetoxymercuration reaction was devel- 
oped for introducing covalently bound mercury atoms into 
the 5-position of the pyrimidine ring, the C-8 position of the 
purine ring or the C-7 position of a 7-deazapurine ring, both 
in nucleotides and polynucleotides. (R. M. K. Dale, D. C. 
Livingston and D. C Ward, Proc. Natl. Acad. Sci. U.S.A., 
70, 2238, 1973; R. M. K. Dale, E. Martin, D. C Livingston 
and D. C. Ward, Biochemistry, 14, 2447, 1975.) It was also 
shown several years ago that organomercurial compounds 
would react with olefinic compounds in the presence of 
palladium catalysts to form carbon-carbon bonds (R. F. 
Heck, J. Am. Chem. Soc., 90, 5518, 1968; R .F. Heck, Ibid., 
90, 5526, 1968; R. F. Heck, Ibid., 90, 5531, 1968; R. F. 
Heck, Ibid, 90, 5535, 1968; and R. F. Heck, J. Am. Chem. 
Soc. 91, 6707, 1969.) Bergstrom and associates (J. L. Ruth 
and D. E Berstrom, J. Org. Chem., 43, 2870, 1978; and D. 
E. Bergstrom and M. K. Ogawa, J. Am. Chem. Soc., 100, 
8106, 1978) and Bigge, et d. (C F. Bigge, P. Kdaritis, J. R. 
Deck and M. P. Mertes, J. Am. Chem. Soc, 102, 2033, 1980) 
have recently applied this reaction scheme in the synthesis 
of C-5 substituted pyrimidine nucleotide compounds. 

Finally, it is known that antibodies specific for modified 
nucleotides can be prepared and used for isolating and 
characterizing specific constituents of the modified nucle- 
otides. (T. W. Munns and M. K. Liszewski, Progress in 
Nucleic Add Research and Molecular Biology, 24, 109, 
1980.) However, none of the antibodies prepared to date 
against naturally occurring nucleotides have been shown to 
react with their nucleotide determinant when it exists in a 
double-stranded RNA or DNA duplex or when in DNA- 
RNA hybrid molecules. 

To circumvent the limitations of radioactively labeled 
probes or previously utilized chemical and biological 
probes, a series of novel nucleotide derivatives that contain 
biotin, irninobiotin, lipoic acid, and other determinants 
attached covalently to the pyrimidine or purine ring have 
been synthesized. These nucleotide derivatives, as well as 
polynucleotides and coenzymes that contain them, will 
interact specifically and uniquely with proteins such as 
avidin or antibodies. The interaction between modified 
nucleotides and specific proteins can be utilized as an 
alternative to radioisotopes for the detection and localization 
of nucleic acid components in many of the procedures 
currently used in biomedical and recombinant-DNA tech- 
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nologies. Methods employing these modified nucleotide- 
protein interactions have detection capacities equal to or 
greater than procedures which utilize radioisotopes and they 
often can be performed more rapidly and with greater 
resolving power. 

These new nucleotide derivatives can be prepared rela- 
tively inexpensively by chemical procedures which have 
been developed and standarized as discussed more fully 
hereinafter. More significantly, since neither the nucleotide 
probes of this invention nor the protein reagents employed 
with them are radioactive, the compounds can be prepared, 
utilized, and disposed of, without the elaborate safety pro- 
cedures required for radioisotopic protocols. Moreover, 
these nucleotide derivatives are chemically stable and can be 
expected to have functional shelf-lives of several years or 15 
more. Finally, these compounds permit the development of 
safer, more economical, more rapid, and more reproducible 
research and diagnostic procedures. 



10 



widely useful as probes in biomedical research and 
recombinant DNA technology. 

Particularly useful are compounds encompassed within 
tins structure which additionally have ne or more of the 
following characteristics: A is non-aromatic; A is at least C 5 ; 
the chemical linkage joining B and A includes an a-olefimc 
bond; A is biotin or iminobiotin; and B is a pyrimidine or 
7-deazapurine. 

These compounds may be prepared by a process which 
involves: 

(a) reacting a compound having the structure: 



x-CH2 0 B 



SUMMARY OF THE INVENTION 
Compounds having the structure: 



x-CH2 0 B — A 
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y z 



y * 
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with a mercuric salt in a suitable solvent under suitable 
conditions so as to form a mercurated compound having the 
structure: 

x-Ofe n B-Hg* 



30 



wherein B represents a purine, deazapurine, or pyrimidine 
moiety covalently bonded to the C 1 -position of the 
sugar moiety, provided that when B is purine or 7-dea- 
zapurine, it is attached at the NP-position of the purine 
or deazapurine, and when Bis pyrimidine, it is attached 35 
at the N -position; 

wherein A represents a moiety consisting of at least three 
carbon atoms which is capable of forming a detectable 
complex with a polypeptide when the compound is 
incorporated into a double-stranded ribonucleic acid, 
deoxyribonucleic acid duplex, or DNA-RNA hybrid; 

wherein the dotted line represents a chemical linkage 
joining B and A, provided that if B is purine the linkage 
is attached to the 8-position of the purine, if B is 
7-deazapurine, the linkage is attached to the 7-position 
of the deazapurine, and if B is pyrimidine, the linkage 
is attached to the 5-position of die pyrimidine; and 

wherein each of x, y, and z represents 

so 



55 



40 



45 



y * 

(b) reacting said mercurated compound, with a chemical 
moiety reactive with the -Hg + portion of said mercu- 
rated compound and represented by the formula . . . N, 
said reaction being carried out in an aqueous solvent 
and in the presence of K2PdCl 4 under suitable condi- 
tions so as to form a compound having the structure: 
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wherein N is a reactive terminal functional group or is A; 
and 

(c) recovering said compound as said modified nucleotide 
when N is A, or when N is a reactive terminal group, 
reacting said compound with a compound having the 
structure M-A, wherein M represents a functional 
group reactive with N in an aqueous solvent under 
suitable conditions so as to form said modified nucle- 
otide, which is then recovered. 

This invention also provides compounds having the struc- 
ture: 
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wherein each of B, B\ and B" represents ,a purine, 30 
7-deazapurine, or pyrimidine moiety covalently 
bonded to the C l -position of the sugar moiety, pro- 
vided that whenever B, B 1 , or B" is purine or 7-dea- 
zapurine, it is attached at the Imposition of the purine 
or 7-deazapurine, and whenever B, B\ or B" is pyri- 35 
midine, it is attached at the N 1 -position; 

wherein A represents a moiety consisting of at least three 
carbon atoms which is capable of forming a detectable 
complex with a polypeptide when the compound is' 
incorporated into a double-stranded duplex formed 40 
with a complementary ribonucleic or deoxyribonucleic 
acid molecule. 

wherein the dotted line represents a chemical linkage 
joining B and A, provided that if B is purine the linkage 
is attached to the 8-position of the purine, if B is 
7-deazapurine, the linkage is attached to the 7-position 
of the deazapurine, and if B is pyrimidine, the linkage 
is attached to the 5-position of the pyrimidine; 

wherein z represents H— or HQ—; and 

wherein m and n represent integers from 0 up to about 
100,000. 

These compounds can be prepared by enzymatic poly- 
merization of a mixture of nucleotides which include the 
modified nucleotides of this invention. Alternatively, nude- 55 
otides present in oligo- ox polynucleotides may be modified 
using chemical methods. 

Nucleotides modified in accordance with the practices of 
this invention and oligo- and polynucleotides into which the 
modified nucleotides have been incorporated may be used as 60 
probes in biomedical research, clinical diagnosis, and 
recombinant DNA technology. These various utilities are 
based upon the ability of the molecules to form stable 
complexes with polypeptides which in turn can be detected, 
either by means of properties inherent in the polypeptide or 65 
by means of detectable moieties which are attached to, or 
which interact with, the polypeptide. 



45 



50 



Some uses include detecting and identifying nucleic acid- 
containing etiological agents, e.g. bacteria and viruses; 
screening bacteria for antibiotic resistance; diagnosing 
genetic disorders, e.g. thalassemia and sickle cell an&mia; 
chromosomal karyotyping; and identifying tumor cells. 

DETAILED DESCRIPTION OF THE 
INVENTION 

Several essential criteria must be satisfied in order for a 
modified nucleotide to be generally suitable as a substitute 
for a radioactively-labeled form of a naturally occurring 
nucleotide. First, the modified compound must contain a 
substituent or probe that is unique, Le., not normally found 
associated with nucleotides or polynucleotides. Second, the 
probe must react specifically with chemical or biological 
reagents to provide a sensitive detection system. Third, the 
analogs must be relatively efficient substrates for commonly 
studied nucleic acid enzymes, since numerous practical 
applications require that the analog be enzymatically 
metabolized, e.g M the analogs must function as substrates for 
nucleic acid polymerases. For this purpose, probe moieties 
should not be placed on ring positions that stoically, or 
otherwise, interfere with the normal Watson— Crick hydro- 
gen bonding potential of the bases. Otherwise, the substitu- 
ents will yield compounds that are inactive as polymerase 
substrates. Substitution at ring positions that alter the normal 
"and" nucleoside conformation also must be avoided since 
such amfonnational changes usually render nucleotide 
derivatives unacceptable as polymerase substrates. Nor- 
mally, such considerations limit substitution positions to the 
5-position of a pyrimidine and the 7-position of a purine or 
a 7-deazapurine. 

Fourth, the detection system should be capable f inter- 
acting with probe substituents incorporated into both single- 
stranded and double-stranded polynucleotides in order to be 
compatible with nucleic acid hybridization methodologies, 
lb satisfy this criterion, it is preferable that the probe moiety 
be attached to the purine or pyrimidine through a chemical 
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linkage or 'linker arm" so that it can readily interact with 
antibodies, other detector proteins, or chemical reagents. 

Fifth, the physical and biochemical properties of poly- 
nucleotides containing small numbers of probe substituents 
should not be significantly altered so that current procedures 5 
using radioactive hybridization probes need not be exten- 
sively modified. This criterion must be satisfied whether the 
probe is introduced by enzymatic or direct chemical means. 

Finally, the linkage that attaches the probe moiety should 
withstand all experimental conditions to which normal 10 
nucleotides and polynucleotides are routinely subjected, 
e.g., extended hybridization times at elevated temperatures, 
phenol and organic solvent extraction, electrophoresis, etc. 

All of these criteria are satisfied by the modified nucle- 
otides described herein* 15 

These modified nucleotides have the structure: 
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7-deazapurines. Moreover, pyrimidines and 7-deazapurines 
useful in this invention must not be naturally substituted at 
the 5- or 7-positions, respectively. As a result, certain bases 
such as thymine, 5-methylcytosine, and 5-hydroxymethyl- 
cytosine are not useful. Presently preferred bases are 
cytosine, uracil, deazaadenine and deazaguanine. 

A may be any moiety which has at least three carbon 
atoms and is capable of forming a detectable complex with 
a polypeptide when the modified nucleotide is incorporated 
into a double-stranded duplex containing either deoxyribo- 
nucleic or ribonucleic acid. 

A therefore may be any ligand which possesses these 
properties, including haptens which are only immunogenic 
when attached to a suitable carrier, but are capable of 
interracting with appropriate antibodies to produce com- 
plexes. Examples of moieties which are useful include: 



X-CH2 0 B — A 



y * 



20 -r_, 



wherein B represents a purine, 7- deazapurine, or pyri- 
midine moiety covalently bonded to the exposition of 
the sugar moiety, provided that when B is purine or 
7-deazapurine, it is attached at the ^-position of the 
purine or 7-deazapurine, and when B is pyrimidine, it 
is attached at the N'-position; 

wherein A represents a moiety consisting of at least three 
carbon atoms which is capable of forming a detectable 
complex with a polypeptide when the compound is 
incorporated into a double-stranded ribonucleic acid, 
deoxyribonucleic acid duplex, or DNA-RNA hybrid; 

wherein the dotted line represents a linkage group joining 
B and A, provided that if B is purine the linkage is 
attached to the 8-position of the purine, if B is 7-dea- 
zapurine, the linkage is attached to the 7-position of the 40 
deazapurine, and if B is pyrimidine, the linkage is 
attached to the 5-position of the pyrimidine; and 

wherein each of x, y and z represents 
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These compounds are widely useful as probes in biomedi- 55 
cal research and recombinant DNA technology. 

Although in principal all compounds encompassed within 
this structural formula may be prepared and used in accor- 
dance with the practices of this invention, certain of the 
compounds are more readily prepared or used or both, and 60 
therefore are presently preferred. 

Thus, although purines, pyrimidines and 7-deazapurines 
are in principal useful, pyrimidines and 7-deazapurines are 
preferred since purine substitution at the 8-position tends to 
render the nucleotides ineffective as polymerase substrates. 65 
Thus, although modified purines are useful in certain 
respects, they are not as generally useful as pyrimidines and 



HN NH 

Y 

0 



HN NH 

Y 

NH 



~ C— \ / ; — C-CH2-NH— ^ \— NQ2; 




NO2 



J. 



-C— CH2— CH 2 C— O— ; — C— (CHafc— 1 ■; and 

OOO 




OH || 
O 

Of these the preferred A moieties are biotin and iminobiotin. 

Moreover, since aromatic moieties tend to intercalate into 
a base-paired helical structure, it is preferred that the moiety 
A be nonaromatic. Also, since smaller moieties may not 
permit sufficient molecular interaction with polypeptides, it 
is preferred that A be at least Cj so that sufficient interaction 
can occur to permit formation of stable complexes. Biotin 
and iminobiotin satisfy both of these criteria. 

The linkage or group joining moiety A to base B may 
include any of the well known bonds including carbon- 
carbon single bonds, carbon-carbon double bonds, carbon- 
nitrogen single bonds, or carbon-oxygen single bonds. How- 
ever, it is generally preferred that the chemical linkage 
include an olefinic bond at the a-position relative to B. Hie 
presence of such an oerolefinic bond serves to hold the 
moiety A away from the base when the base is paired with 
another in the well known double-helix configuration. This 
permits interaction with polypeptide to occur more readily, 
thereby facilitating complex formation. Moreover, single 
bonds with greater rotational freedom may n t always h Id 
the moiety sufficiently apart from the helix to permit rec- 
ognition by and complex formation with polypeptide. 
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It is even more preferred that the chemical linkage group 
be derived from a primary amine, and have the structure 
— CHa— NH— , since such linkages are easily formed uti- 
lizing any of the well known amine modification reactions. 
Examples of preferred linkages derived from allylamine and 
aUyl-(3-amino-2-hydroxy-l-propyl) ether groups have the 
formulae 

— CH=CH— CH2"~NH— and 

— CH=:CH— Ofe— O— Ofe— CH— CHj— NH— , 
OH 



20 



respectively. 

Although these linkages are preferred, others can be used, 
including particularly olefin linkage arms with other modi- 
fiable functionalities such as thiol, carboxylic acid, and 
epoxide functionalities. 

The linkage groups are attached at specific positions, 
namely, the 5-position of a pyrirnidine, the 8-position of a 
purine, or the 7-position of a deazapurine. As indicated 
previously, substitution at the 8-position of a purine does not 
produce a modified nucleotide which is useful in all the 25 
methods discussed herein. It may be that the 7-position of a 
purine, which is occupied by a nitrogen atom, could be the 
point of linkage attachment However, the chemical substi- 
tution methods employed to date and discussed herein are 
n t suitable for this purpose. 

The letters x, y, and z represent groups attached to the 5', 
3', and 2' positions of the sugar moiety. They may be any of 
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Although conceivable, it is unlikely that all of x, y, and z 
will simultaneously be the same. More likely at least one of 
x, y, and z will be a phosphate-containing group, either 
mono-, di-, or tri-phosphate and at least one will be HO — 
or H — . As will be readily appreciated, the most likely 
identity of z will be HO — or H — indicating ribonucleotide 
or deoxyribonucleotide, respectively. Examples of such 
nucleotides include 5*-ribonucleoside monophosphates, 
5-Hribonucleoside diphosphates, 5-ribonucleoside triphos- 
phates, S'-deoxyribonucleoside monophosphates, 5*-deox- 
yribonucleoside diphosphates, S'-deoxyribonucleoside 55 
triphosphates, 5'p-ribonucleoside-3'p t and 5*p-deoxyribo- 
nucleoside-3'p. More specific examples include modified 
nucleotides of this type in which A is biotin or iminobiotin, 
the chemical linkage is 

60 

- O^CH-CHa— NH- or 
-CH=CH-CH2— O— Ofe— CH— Ofe— NH— , 

OH 65 

and B is uracil or cytosine. 
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The general synthetic approach adopted for introducing 
the linker arm and probe moiety onto the base is discussed 
hereinabove. (See especially, J. L. Ruth and D. E. Berg- 
strom, J. Org. Chera, 4.3, 2870, 1978; D. E. Bergstrom and 
M. K. Ogawa, J. Amer. Chem. Soc. 100, 8106, 1978; and C. 
F. Bigge, P. Kalaritis, J. R. Deck and M. R Mertes, J. Amer. 
Chem. Soc. 102, 2033, 1980.) However, the olefin substitu- 
ents employed herein have not been used previously. To 
facilitate attachment of probe moiety A, it has been found 
particularly desirable to employ olefins with primary amine 
functional groups, such as allylamine [AA] or allyl-(3- 
amino-2-hydroxy-l-propyl) ether [NAGE], which permit 
probe attachment by standard amine modification reactions, 
such as, 



NHz 
II 

-CHaNIfc+R— C— OR- 
Im idnte 

O 
II 

R-C 



NH2 
^ II 
-> — CH2NHCR 



O ^ -CHiNHCR 



-CH2NH2+R-C 

o 



30 




-OI2NH2 + 



NHS-ester (N-hydroxysucrimmide) 
S 

-OI2NH2 + R-N=C=S > — CH2NHCNHR 

Isothiocyazmte 

O 

— QfcNI^+ / X , - n .lt- 



OH 

-> -CHaNHOIiCHR 



Epoxide 



Because of ease of preparation it has been found preferable 
to use NHS-esters for probe addition. However, olefin linker 
arms with other modifiable functional groups, such as thiols, 
carboxylic acids, epoxides, and the like, can also be 
employed. Furthermore, both linker arm and probe can be 
added in a single-step if deemed desirable. 
Specifically, modified nucleotides having the structure: 



x-Ofc ft B — A 



y * 



wherein B represents a purine, 7-deazapurine, or pyrirni- 
dine moiety covalently bonded to the exposition of 
the sugar moiety, provided that when B is purine or 
7-deazapurine, it is attached at the NP-position of the 
purine or deazapurine, and when B is pyrirnidine, it is 
attached at the ^-position; 
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wherein A represents a moiety consisting of at least three 
carbon atoms which is capable of forming a detectable 
complex with a polypeptide when the compound is 
incorporated into a double-stranded ribonucleic acid, 
deoxyribonucleic acid duplex, DNA-RNA hybrid; 

wherein the dotted line represents a chemical linkage 
joining B and A, provided that if B is purine, the 
linkage is attached to the 8-position of the purine, if 
7-deazapurine, the linkage is attached to the 7-position 
of the deazapurine, and if B is pyrimidine, the linkage 
is attached to the 5-position of the pyrimidine; and 

wherein each of x, y, and z represents 
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can be prepared by: 
(a) reacting a compound having the structure: 

*-CH 2rt B 



y * . 



is 



20 



25 



30 



with a mercuric salt in a suitable solvent under suitable 
conditions so as to form a mercurated compound having the 35 
structure: 



X-CH2 n B-Hg+ 




(b) reacting said mercurated compound with a chemical 45 
moiety reactive with the — Hg + portion, of said mer- 
curated compound and represented by the formula . . . 
N, said reaction being carried out in an aqueous solvent 
and in the presence of KjPdC^ under suitable condi- 
tions so as to form a compound having the structure: 50 



x-CHj 0 B — N 



y * 



55 



wheremN is a inactive terminal functional group or is A; 
and 50 

(c) recovering said compound as said modified nucleotide 
when N is A, or when N is a reactive terminal group, 
reacting said compound with a compound having the 
structure M-A, wherein M represents a functional 
group reactive with N in an aqueous solvent under 65 
suitable conditions, so as to form said modified nucle- 
otide, which is then recovered. 
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The following schema is illustrative: 
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40 
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Although the reactions can be carried out at hydrogen ion 
concentrations as low as pH 1, or as high as pH 14, it is 
preferred to operate in the range from about 4 to 8. This is 
especially true when dealing with unstable compounds such 
as nucleoside polyphosphates, polynucleotides, and nucle- 
otide coenzymes which are hydrolyzed at pH's outside this 
range. Similarly, it is preferred to operate at a temperature in 
the range from about 20° C. to 30° C. to avoid possible 
decomposition of labile organic substrates. H wever, the 
reactions can be carried out at temperatures from about 5° C. 
to 100° C. As is usual with chemical reactions, higher 
temperatures promote the reaction rate and lower tempera- 
tures retard it Thus, in the temperature range from 5° C. to 
100° C, the ptimum reaction time may vary from about 10 
minutes to 98 hours. In the preferred temperature range, 
reaction times normally vary from about 3 to 24 hours. 
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The preferred procedure for m aintainin g the pH in the 
desired range is through the use of buffers. A variety of 
buffers can be employed These include, for example, 
sodium or potassium acetate, sodium or potassium citrate, 
potassium citrate-phosphate, tris-acetate and borate-sodium 5 
hydroxide buffers. The concentration of buffer, when 
employed, can vary over a wide range, up to about 2.0 molar. 

While a particular advantage of the mercuration and 
palladium catalyzed addition reactions is that they can be 
carried out in water, small amounts of an organic solvent can 
be usefully included as a solubility aid The organic solvents 
usually chosen are those which are miscible with water. 
These may be selected from ethers, alcohols, esters, ketones, 
amides, and the like such as methanol, ethanoU propanol, 
glycerin, dioxane, acetone, pyridine and dimethylforma- 
mide. However, since it has been observed that the presence 
of alcohols, such as methanol, often results in alkoxy- 
addition across the olefin double bond, any organic solvent 
used as a solubility aid should be chosen carefully. Intro- 
duction of alkoxy substituents to the a - or ^-exocyclic 
carbon atoms often results in the production of compounds 
which are utilized much less efficiently as enzyme sub- 
strates. 

Although various mercuric salts may be utilized, the 
presently preferred salt is mercuric acetate. Also, as indi- ^ 
cated previously, the compounds may be prepared by first 
adding a linker arm and then the moiety A, or by adding a 
linker arm to which Ais already attached Thus, the chemical 
moiety represented by the formula . . . N may be any one of 
the numerous entities which ultimately result in production 3Q 
of the desired compounds. 

Examples include 



20 



-01=01—012— NH* 

— ch=oi-- ok-o— ofe— ch— ah— nh* 
oh 

-CH=CH-CH a -NH-Wotin ( and 

-CH=CH2— Ok— 0— Ofa— CH— CH2— ■ NH-iminobiotin. 
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The amounts of the reactants employed in these reactions 
may vary widely. However, in general the amounts of 45 
urmiercurated compound, mercurated compound, and palla- 
(tium-ccfttaiiiing compound will be substantially stoichio- 
metric whereas the mercuric salt and compound . . . N will 
be present in molar excess, e.g. 5-20 moles of ... N or of 
mercuric salt per mole of mercurated compound or urnner- 50 
curated compound, respectively. In practice, amounts will 
vary depending upon variations in reaction conditions and 
the precise identity of the reactants. 

Having the biotin probe directly attached to nucleotide 
derivatives that are capable of functioning as enzyme sub- 55 
strates offers considerable versatility, both in the experimen- 
tal protocols that can be performed and in the detection 
methods (microscopic and non-microscopic) that can be 
utilized for analysis. For example, biotin nucleotides can be 
introduced into polynucleotides which are in th process of 60 
being synthesized by cells or crude cell extracts, thus 
making it possible to detect and/or isolate nascent (growing) 
polynucleotide chains. Such a procedure is impossible to d 
by any direct chemical modification method Furthermore, 
enzymes can be used as reagents for introducing probes such 65 
as biotin into highly selective or site-specific locations in 
polynucleotides; the chemical synthesis of similar probe- 



modified products would be extremely difficult to achieve at 
best 

The synthesis of nucleotides containing biotin or irrrino- 
biotin was achieved as detailed in the examples set forth 
hereinafter. Pyrimidine nucleoside triphosphates containing 
either of these probes attached to the C-5 carbon atom were 
good to excellent substrates for a wide variety of purified 
nucleic acid polymerases of both prokaryotic and eukaryotic 
origin. These include DNA polymerase I oiK coli, bacte- 
riophage T4 DNA polymerase, DNA polymerases a and P 
from murine (A-9) and human (HeLa) cells, and the DNA 
polymerase of Herpes simplex virus. Corifirming data were 
obtained with & coli DNA polymerase I using either the 
nick-translation condition of Rigby, et al. (P. W. J. Rigby, M. 
Dieckmann, C. Rhodes and P. Berg, J. Mol. Biol. 113, 237, 
1977) or the gap-filling reaction described by Bourguignon 
et al. (G. J. Bourguignon, P. J. Tkttersall and D. C. Ward, J. 
Virol. 20, 290, 1976). Bio-dUTP has also been found to 
function as a polymerase substrate both in GHO cells, 
permeabilized by treatment with lysoletithin according to 
the method of Miller, et al. (M. R. Miller, J. C Casteliot, Jr. 
and A. B. Pardee, Exp. Cell Res. 120,421, 1979) and in a 
nuclear replication system prepared from Herpes simplex 
infected BHK cells. Although biotinyl ribonucleoside triph- 
osphates were found to function as substrates for the RNA 
polymerases of & coli and bacteriophage T7, they are not 
utilized as efficiently as their deoxyribonucleotide triphos- 
phate counterparts. Indeed, they are incorporated poorly, if 
at all, by the eukaryotic RNA polymerases examined (HeLa 
cell RNA polymerase m, calf thymus RNA polymerase II 
and mouse cell RNA polymerase II). While this limited 
range of substrate function does restrict the utility in some 
in vivo or in vitro transcription studies, biotinlabeled RNA 
probes can be prepared enzymatically from DNA templates 
using & coli or T7 RNA polymerases or by 3' end-labeling 
methods using RNA ligase with compounds such as bioti- 
nyl-pCp. The AA- and NAGE-derivatives of UTP are, 
however, substrates for the eukaryotic RNA polymerases 
mentioned above. With the availability of antibodies to these 
analogs, the isolation of nascent transcripts by immunologi- 
cal or affinity procedures should be feasible. 

The enzymatic polymerization of nucleotides containing 
biotin or irninobiotin substituents was not monitored 
directly, since neither of these probes were radiolabeled 
However, two lines of experimental evidence clearly show 
that the biotinyl-nucleotides were incorporated The first is 
that polynucleotides synthesized in the presence of biotin- 
nucleotides are selectively retained when chromatography 
over avidin or streptavidin affinity columns. (Tables I and II) 
For example, whereas normal DNA, nick translated with 
32 P-dAMP, is quantitatively eluted upon the addition of 
0.5M NaCl, the vast majority of biotinyl-DNA or iminobi- 
otinyl-DNA remains bound to the resin even after extensive 
washing with high salt, urea, quanidine-HQ, formamide or 
50 mM NaOH. The small fraction of the radiolabel eluted by 
these washing conditions is not retained when applied to the 
resin a second time, suggesting that radioactivity is associ- 
ated with DNA fragments which are free of biotin substitu- 
tion. The second line of evidence is that only biotin-labeled 
polynucleotides are immunoprecipitated when treated with 
purified anti-biotin IgG f llowed by formalin-fixed Staphy- 
lococcus aureus, (Tfcble EI) It is clear from the data in these 
tables that extremely small amounts of biotin can be 
detected by this method These results also show that the 
biotin molecule can be recognized by avidin, streptavidin or 
specific antibodies while the DNA is still in its native, 
double-stranded form, a condition that is absolutely essential 
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if the antibody-binding or avidin-affinity approaches are to 
be useful in probe detection employing hybridization tech- 
niques. 

TABLE I 

SELECTIVE RETENTION OF BIOTINIZED DNA 
ON AVIDIN-SEPHAROSE 



Ehient 




% DNA Retain 


ed on Resin 


Bio-DNA (1%) 


T-DNA 


Load - 


3x 10 s cpm 
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TABLE n 



Affinity Chromatography of InrinobiotiiMlUTP 
and IminoMotinized - PNA on Streptavidin-Sephaross 



% Retained on SA-Sepharose 



10 



15 



20 



25 



Ehient 




T-DNA 


3 H-IB-dUTP 


IB-DNA 


Load- 


10 mM TrivHQ, 
8.3 50 mM NaQ 


8.7 


100 


99.7 


(1) 


0.1 M NaQ 


<0.1 


100. 


99.7 


(2) 


1.0 M NaQ 


<0.01 


100 


99.4 


(3) 


8MUrea 


<0.01 


97.5 


98.5 


(4) 


6 M goanidine-HQ 


<0.01 


97.0 


97.0 


(5) 


50 mM NHi-acetate, 
pH4.0 


«0.01 


<0.01 


96.5 


(6) 


50 mM NHt-acctate, 

pH4.0 

2 mM biotin 


<0.01 


<0.01 


<0.01 



TABLE m 



SELECTIVE IMMUNOPRECIPITATION OF BIO-DNA 
WITH ANn-BIOTIN IgG and STAPH AUREUS 



CPM in 



CPM in 



DNA* 


Antibody 




Snpexnatant 


T-DNA 




70 


4867 


T-DNA 


Anti-Bio IgO 


87 


5197 


T-DNA 


Non-immune IgG 


55 


5107 


Bio-DNA 




53 


3886 


Bio-DNA 


Anti-Bio IgO 


3347 


736 


Bio-DNA 


Non-imimme IgO 


60 


3900 



*N.T. pBR-322 DNA, 32 p-labeled; 1% Biotin ^institution. Specific activity, 
2 x id 7 cpm/ug Biotin detection 0.001-0.01 pmolea. 

Thus, it is possible to prepare novel compounds having 

the structure: 
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wherein each of B, B' and B", represents a purine, 
deazapurine, or pyrimidine moiety covalently bonded 
to the C 1 -position of the sugar moiety, provided mat 
whenever B, B\ or B M is purine or 7-deazapurine, it is 
attached at the N*-p siti n of the purine or deazapu- 
rine, and whenever B, B', or B" is pyrimidine, it is 
attached at the N 1 -position; 
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wherein A represents a moiety consisting of at least three 
carbon atoms which is capable of forming a detectable 
complex with a polypeptide when the compound is 
incorporated into a double-stranded duplex formed 
with a complementary ribonucleic or deoxyribonucleic 5 
acid molecule, 
wherein the dotted line represents a linkage group joining 
B and A, provided that if B is purine, the linkage is 
attached to the 8-position of the purine, if B is 7-dea- 
zapurine, the linkage is attached tothe7-positionof the 10 
deazapurine, and if B is pyrirnidine, the linkage is 
attached to the 5-position of the pyrirnidine; 
wherein z represents H— or HO—; and 
wherein m and n represent integers from 0 up to about 

100,000. 15 
Of course, it should be readily understood that in general 
m and n will not simultaneously be 0 since, in that event, die 
compound becomes merely a modified nucleotide as 
described previously. In general B' and B" will vary within 
the same oligo- or polynucleotide, being alternatively uracil, ^ 
cytosine, thymine, guanine, adenine, or the like. Also, in 
general, the variation will correspond to the ordered 
sequence of nucleotides which codes for the synthesis of 
peptides according to the well known Genetic Code. How- 
ever, it is intended that the structure shown also embrace 
polynucleotides such as poly C, poly U, poly r(A-U), and 25 
poly d(A-U) as well as calf thymus DNA, ribosomal RNA 
of K coli or yeast, bacteriophage RNA and DNA (R17, fd), 
animal viruses (SV40 DNA), chromosomal DNA, and the 
like, provided only that the* polynucleotides be modified in 
accordance with this invention. 30 

It is also to be understood that the structure embraces 
more than one modified nucleotide present in the oligomer 
or polymer, for example, from two to thirty modified nucle- 
otides. Hie critical factor in this regard is that the number of 
modifications not be so great that the polynucleotide is 35 
rendered ineffective for the intended use, 

Finally, it should be understood that modified oligo- and 
polynucleotides can be joined to form larger entities having 
the same structure so long as terminal groups are rendered 
compatible or reactive. 40 

These compounds can be made by enzymatic polymer- 
ization of appropriate nucleotides, especially nucleotide 
triphosphates in the presence of a nucleic acid template 
which directs synthesis under suitable conditions. Such 
conditions can vary widely depending upon the enzyme 45 
mployed, amounts of nucleotides present, and other vari- 
ables. Illustrative enzymes include DNA polymerase I of R 
coli, bacteriophage T4 DNA polymerase, DNA polymerases 
<x and 0 from murine and human (HeLa) cells, DNA 
polymerase from Herpes simplex virus, RNA polymerase of 50 
& colU RNA polymerase of bacteriophage T7, eukaryotic 
RNA polymerase including HeLa cell RNA polymerase m, 
calf thymus RNA polymerase 21, and mouse cell RNA 
polymerase II. 

Also, the compounds can be prepared by terminal addi- 55 
tion to oligo- or polynucleotides to produce compounds in 
which m or n is 0 depending upon whether the addition is at 
the 5' or 3' position. Moreover, the compounds such as pCp 
or pUp in which the base is biotinized can be added to 
existing molecules employing the enzyme RNA ligase. 60 

Modified oligo- and polynucleotides can also be prepared 
by chemical modification of existing oligo- or polynucle- 
otides using the approach described previously for modifi- 
cation of individual nucleotides. 

The various modified nucleotides, oligonucleotides, and 65 
polynucleotides of this invention may be detected by con- 
tacting the compounds with polypeptides which are capable 



of forming complexes therewith under suitable conditions so 
as to form the complexes, provided that the polypeptides 
include one or more moieties which can be detected when 
the complex or complexes is or are formed, generally by 
means of conventional detection techniques. 

One polypeptide detector for the biotinyl-type probe is 
avidin. The avidin-biotin interaction exhibits one of the 
tightest non-covalent binding constants (K d£j =10 =15 ) seen in 
nature. If avidin is coupled to potentially demonstrable 
indicator molecules, e.g., fluorescent dyes (fluorescein, 
rhodamine), electron-dense reagents (ferritin, hemocyanin, 
colloidal gold), or enzymes capable of depositing insoluble 
reaction products (peroxidase, alkaline phosphatase) the 
presence, location and/or quantity of the biotin probe can be 
established. 

Avidin has, unfortunately, one property that makes it less 
desirable as a biotm-indicator protein when used in con- 
junction with nucleic acids or chromatin material. It has 
been reported (M. H. Heggeness, Stain lechnoL, 52, 165, 
1977; M. H. Heggeness and J. F. Ash, J. Cell. Biol., 73, 783, 
1977; E. A. Bayer and M. Wilchek, Methods of Biochemical 
Analysis 26, 1, 1980) that avidin binds tightly to condensed 
chromatin or to subcellular tractions that contain large 
amounts of nucleic acid in a manner which is independent of 
its biotin-binding property. Since avidin is a basic glyco- 
protein with a pi of 10.5, its histone-lDce character or its 
carbohydrate moieties are most likely responsible for these 
observed non-specific interactions. 

A preferred probe for biotin-containing nucleotides and 
derivatives is streptavidin, an avidin-like protein synthesized 
by the soil organism Streptomyces avidiniL Its preparation 
and purification is described in Hoflman, et al., Proc. Natl. 
Acad. Sci., 77, 4666 (1980). Streptavidin has a much lower 
pi (5.0), is non-glycosylated, and shows much lower non- 
specific binding to DNA than avidin, and therefore offers 
potential advantages in applications involving nucleic acid 
detection methodology. 

A most preferred protein for biotin-like probe detection is 
monospecific rabbit IgG, anu'biotin immunoglobulin. This 
compound was prepared by immunizing rabbits with bovine 
serum albumin conjugated biotin as described previously 
(M. Berger, Methods in Enzymology, 62, 319[1979]) and 
purified by affinity chromatography. Although the associa- 
tion constant of immunoglobulin-haptens have values of 
(10 6 to 10 10 ) which are considerably lower than for 
avidin-biotin complexes, they are substantially equivalent to 
those observed with the avidin-irninobiotin complex. Fur- 
thermore, the anti-biotin antibodies have proven extremely 
useful in detecting specific polynucleotide sequences on 
chromosomes by in situ hybridization since little, if any, 
non-specific binding of the antibody to chromatin material 
occurs. 

The modified polynucleotides of this invention are 
capable of denaturation and renaturation under conditions 
compatible with their use as hybridization probes. An analy- 
sis of the thermal denaturation profiles and hybridization 
properties of several biotin-substituted DNA and RNA poly- 
mers clearly indicates this. For example, pBR 322 DNA or 
X DNA, nick translated to introduce approximately 10-100 
biotin residues per kilobase, have Tm values essentially 
identical to that of the control, biotin-free DNAs. Further- 
more, ^P-labeled, biotin-substituted, pBR 322 DNA, exhib- 
ited the same degree of specificity and autoradi graphic 
signal intensity as control, thyrmdine-coritairiing DNA, 
when used as a hybridization probe for detecting bacterial 
colonies containing the plasmid. 

In DNA duplexes, such as MVM RF DNA, in which every 
thymidine residue in one strand (1250 in toto per 5 Kb) is 
replaced by a biotmyl-nucleotide, the Tm is only 5° C. less 
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than that of the unsubstituted control. Although the Tm of 
poly d(A-bioU) in which each base pair contains a bio- 
dUMP residue is 15° C. lower than the poly d(A-T) control, 
the degree of cooperativity and the extent of hyperchronric- 
ity observed both during denaturation and ienaturation were 
the same for the two polymers. A parallel analysis of RNA 
duplexes and DNA/RNA hybrids indicates that their Tm's 
also decrease as the biotin-content of the polymer increases. 
However, it is clear that a substantial number of biotinmol- 
ecules can be introduced without significantly altering the 
hybridization characteristics of the polymers. 

These results strongly suggested that biotin-substituted 
polynucleotides could be used as probes for detecting and/or 
localizing specific polynucleotide sequences in chromo- 
somes, fixed cells, or tissue sections. The general protocol 
for detecting the biotin-substituted probe is schematically 
illustrated as follows: 



GENERAL PROTOCOL FOR PROBE DETECTION 
VIA IN SITU, COLONY, OR NORTHERN/SOUTHERN 
HYBRIDIZATION METHODS 

And probcj^gupnro^ 



1) Target 
Delivery 



Hybridize with biotinizedor 
haptexrized probe (with or with- 
out cloning vehicle sequences) 
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2) Signal 
Amplification 




Biotin 



1) Avidin - peroxidase 

2) IgO - peroxidase 

3) Primary cc- 
determinentlgG 



6 6 A A 6 




(2) 



3) Detection: 1) Insoluble peroxidase products : DAB 
2) Antibody sandwiching techniques 
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This general scheme illustrates only procedures used for 
gene mapping (cytogenetics), and recombinant DNA-tech- 
nologies. However, it can be equally well applied to the 
detection of nucleic acid sequences of bacterial, viral, fungal 
or parasite origin in clinical samples and this forms the basis 55 
of a powerful new approach to clinical diagnostics which 
does not rely on the use of radioisotopes. 

Immunological and histochemical methods for the detec- 
tion of bi tin have shown that the basic approach is useable 
for a rapid method ofgene mapping in situ hybridization and 60 
non-radioactive procedures for detecting specific nucleic 
acid sequences by blotting hybridizati n methods. Use may 
be made of this technology in development of new clinical 
diagnostic procedures. 

Using this approach, it is possible to determine the 65 
presence of a specific deoxyribonucleic or ribonucleic acid 
molecule, particularly such a molecule derived from a living 



organism, e.g. bacteria, fungus, virus, yeast, or mammal. 
This in turn permits diagnosis of nucleic arid-containing 
etiological agents in a patient or other subject 

Moreover, it provides a method for screening bacteria to 
determine antibiotic resistance. Thus, far example, penicillin 
resistance in Streptococcus pyogenes or Neisseris meningiti- 
dis; tetracycline resistance in Staphylococcus aureus, Can- 
dida albicans, Pseudomonas aeruginosa, Streptococcus 
pyogenes* or Neisseria gonorrhoeae; and aminoglycoside 
resistance in Mycobacterium tuberculosis can be deter- 
mined. 

In these methods a polynucleotide is prepared which is 
complementary to the nucleic acid sequence which charac- 
terizes the organism or its antibiotic resistance and which 
additionally includes one or more modified nucleotides 
according to this invention. This polynucleotide is hybrid- 
ized with nucleic acid obtained from the organism under 
scruntiny. Failure to hybridize indicates absence of the 
organism or of the resistance characteristic. Hybridized 
nucleic add duplexes are then identified by forming a 
complex between the duplex and a suitable polypeptide 
which carries a detectable moiety, and detecting the presence 
of the complex using an appropriate detection technique. 
Positive detection indicates that the complex, the duplex and 
therefore the nucleic acid sequence of interest are present 

This approach can be extended to the diagnosis of genetic 
disorders, such as thalassemia and sickle cell anemia. Hie 
deoxyribonucleotide acid gene sequence whose presence or 
absence (in the case of thalassemia) is associated with the 
disorder can be detected following hybridization with a 
polynucleotide probe according to this invention based upon 
complex formation with a suitable detectable polypeptide. 

Hie mapping of genes or their transcripts to specific loci 
on chromosomes has been a tedious and time-consuming 
occupation, involving mainly techniques of cell-fusion and 
somatic cell genetics. Although in situ hybridization has 
been employed successfully for mapping single-copy gene 
sequences in species that undergo chromosomes polyteni- 
zation, such as Drosophila, detection of unique sequence 
genes in most higher eukaryotic chromosomes has been 
extremely difficult, if not impossible, using standard 
hybrization methods. The necessity for polynucleotide 
probes of very high specific radioactivity to facilitate auto- 
radiographic localization of the hybridization site also 
results in rapid radiodecomposition of the probe and a 
concomitant increase in the background noise of silver grain 
deposition. The use of hybridization probes with low to 
moderate specific radioactivities requires exposure times of 
many days or weeks, even to detect multi-copy sequences, 
such as ribosomal RNA genes or satellite DNA. Since 
recombinant DNA technology has made feasible the 
molecular cloning of virtually every single-copy sequence 
found in eukaryotic cells, it would be extremely beneficial to 
have a rapid and sensitive method for mapping the chro- 
raosmal origin of such cloned genomic fragments. 

Modified nucleotides may be used in a method of gene 
mapping by in situ hybridization which circumvents the use 
of radioisotopes. This procedure takes advantage of a thy- 
midine analogue containing biotin that can be incorporated 
enzymatically into DNA probes by nick translation. After 
hybridization in situ the biotin molecules serve as antigens 
for affinity purified rabbit anti-biotin antibodies. Immunof- 
luorescent antibody sandwiches made with fluorescein-la- 
.beled goat anti-rabbit IgG allow for rapid and specific 
cytogenetic localization of cloned gene sequences as green- 
yellow bands. This method offers four major advantages 
over conventional autoradiographic methods fin situ gene 
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localization; less background noise, an increase in resolving should greatly facilitate our understanding of the genetic 

power between bands; a decrease in the time required to organization of the chromosome and make clinical cytoge- 

detennine the site of probe hybridization; and chemically netic diagnosis much more rapid and practical, 

stable hybridization probes. This method has been applied While a single-step "antibody sandwich" method in which 

successfully to the localization of rcitterated and unique 5 the chromosome spread is challenged, post-hybridization, 

DNA sequences in the polytene chromosome of Drosophila with rabbit anti-biotin IgG may succeed, this protocol may 

milanogaster and satellite DNA on mouse metaphase chro- not generate sufficient fluorescence for unambiguous gene 

mosomes. assignments. However, a much stronger fluorometric signal 

Thus it has been found that polytene chromosomes could can be achieved by using the "haptene-antibody sandwich 
be used as a test system for establishing the efficacy of 10 technique" described by Lamm, et al., (1972); Wofsy, et al., 
probes using the modified nucleotides according to the (1974). In this procedure the primary antibody, in our case 
instant invention as detected by indirect immunofluores- monospecific, rabbit anti-biotin IgG, is chemically modified 
cence for in situ gene mapping. The probes included a with a haptenization reagent, such as 2,4-dinitrofluoroben- 
variety of cloned Drosophila sequences obtained from Otto zene, preferably while the immunoglobulin is bound to an 
Schmidt and Dieter Soil, such as tRNA genes cloned in 15 antigen affinity column (biotin-Sepharose TM). As many as 
plasmid vectors with inserts of sizes ranging from about 5 to 15-20 haptene (DNP) groups can be coupled to the primary 
about 22 ldlobases. Many of these clones have already been antibody without decreasing its antigen binding affinity or 
assigned to specific bands on the Drosophila chromosome specificity (Wallace and Wofsy, 1979). If the primary anti- 
map by conventional in situ hybridization methods employ- body treatment of the test sample is followed by an incu- 
ing radioisotopes. 20 bation with a fiuorescently labeled anti-hapten IgG antibody, 

DNA probes were nick translated in the presence of rather than a fiuorescently labeled anti-IgG, a 5-7 fold 

Bio-dUTP. Occasionally 3 H dATP and/or 3 H dCIP was increase in fluorescence signal can be achieved. Since one 

included in the nick translation reaction mixture. This also has available monospecific guinea pig anti-DNP IgG, 

allowed both autoradiographic and immunofluorescent we can haptenize this secondary antibody with biotin and 

localization of a sequence on a single chromosome spread 25 thus generate two anti-hapten IgG populations, DNP-labeled 

In situ hybridization was performed as described in M. L. anti-biotin IgG and biotin-labeled anti-DNP IgG. If these 

Pardue, and J. G. Gall, Methods in Cell Biol, 10, 1 (1975). can be used alternately to achieve several rounds of hapten- 

After the final 2xSSC wash to remove unhybridized probe, antibody sandwiching and then followed with fiuorescently 

the slides were rinsed with PBS (phosphate buffered saline) labeled protein A from Staphylococcus aureus, which binds 

and incubated at 37° C. with 2.5 pg/ml Rabbit anti-biotin in 30 specifically to IgG molecules from many mammalian S pe- 

PBS and 10 mg/ml BSA for 2-16 hours. This was followed cies, it could result in an enormous amplification of the 

by incubation of the slides with FTTC labeled Goat anti- primary antibody signal with its concomitant utility. 

Rabbit IgG (Miles Laboratories, diluted 1:100 in PBS and 10 The protein streptavidin from Streptomyces avidini is a 

mg/ml BSA) for one-four hours. Evans Blue was often potential alternative to anti-biotin IgG as a vehicle to spe- 

required as a red counterstain to see the chromosomes with 35 cificaUy direct a coupled visualization system [e.g., fluores- 

fluorescent illumination. cent probes (above) or histochemical reagents (below)] to 

When plasmids pBR 17D and pPW 539 containing 5 Kb the site of the hybridized biotin-containing polynucleotide, 

and 22 Kb inserts, respectively, were hybridized by this One of strcptavidin's advantages over anti-biotin IgG is that 

method, it was found that the pattern of hybridization is its affinity for biotin is K^^-10 15 whereas association 

reproducible from spread to spread and is observed imam- 40 constants for haptene-IgG interactions are 10 7 to 10 10 . Hie 

biguously on greater than 90% of the chromosome spreads fast reaction rate and extreme affinity mean that the time 

on a given slide. required to localize the biotinized probe will be minutes with 

The cloned transposable element pAC 104 is known to streptavidin versus hours with immunologic reagents, 

map at many sites along the Drosophila genome. Compari- Initial evaluations of a streptavidin detection system are 

son of the autoradiograph and the fluorescent picture 45 currently in progress. Polytene chromosomes hybridized 

obtained by in situ hybridization of this probe illustrates a with biotinized DNA probes will be incubated with strepta- 

major advantage of this method, ie., that where diffuse vidin followed by a subsequent incubation with bovine 

regions of silver grains appear on an autoradiograph, dou- serum albumin which has been doubly labeled with biotin 

blets or a series of bands are discernible by immunofluo- and FTTC (FTTC, biotinyl-BS A). Since only one of the four 

rescent labeling. 50 streptavidin submits is likely to be involved in binding at 

The other immediately obvious advantage of this method each biotinized DNA site, potentially one labeled BSA 

is the tremendous decrease in time required for gene assign- molecule can bind to each of the remaining three noncon- 

ments to be made by indirect immunofluorescence. An jugated subunits of the streptavidin-biotinyl nucleotide com- 

assignment of a DNA fragment to a specific band can be plex. The fluorescence signal from this single streptavidin+ 

made within six hours of hybridization. This is in compari- 55 FTTC, biotinyl-BS A layer will be compared with a control 

son to days or weeks required for autoradiographic exposure using the basic "antibody sandwich method" described 

methods. This factor, in combination with increased resolu- earlier. 

tion, makes the use of modified nucleotides detected by If the "antibody sandwich" and streptavidin+FTTC, bioti- 

indirect immunofluorescence immediately preferable to nyl-BSA detecti n intensities are comparable, one can 

more classical methods. 60 attempt to enhance the streptavidin+FTTC, biotinyl-BSA 

It has been shown that this immun logical method also system to single-copy copy sensitivity in a manner that 

works with mammalian chromosomes wherein satellite parallels the multiple "haptene-antibody sandwich" 

DNA has been mapped to the centromeric regions of mouse approach. Since some of biotin groups n BSA will not be 

metaphase chromosomes. Hie result provides a basic foun- bound to the first layer of streptavidin, a second layer of 

dation for the development of a simple gene mapping 65 streptavidin can be added until sufficient signal is obtained, 

procedure for single copy (unique) sequences in chromo- For example, if in the second layer, only two streptavidin 

somes from human and other mammals. Such a procedure protomers bind to each first-layer BSA and each of these 
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streptavidin protomers binds three FTTC-biotinyl BS A mol- 
ecules, then the second layer intensity will be twice as great 
as that from the first layer, for the third layer, with analogous 
binding stoichiometrics, the flu rescent intensity will be 
12-fold that of the first layer, so the total intensity will 5 
rapidly increase With successively added layers. There are 
plans to use a larger carrier protein such as thyroglobulin 
rather than BSA in order to maximize amounts of attached 
fluorescent and biotin probes. It may also be necessary to use 
a longer linker arm between the biotin probe and the carrier 10 
protein. A longer linker arm should sterically optimize the 
theoretical delivery of a biotinized fluorescent carrier mol- 
ecule to each nonconjugated streptavidin subunit and maxi- 
mize the number of streptavidin protomers in the subsequent 
layer which will bind to the biotinized fluorescent carrier. As is 
before, appropriate controls will be done to insure that 
substitution of the carrier protein with fluorescent probes 
and biotin does not cause solubility and/or nonspecific 
binding problems. 

The streptavidin-carrier protein delivery system has two 20 
significant advantages over the immunfluorescent approach 
in addition to its speed of delivery. First, only two protein 
components are needed to form the layers. Second, only the 
carrier protein needs to be modified and it is not necessary 
to maintain functional or even total structural integrity as 25 
long as the biotin groups are accessible to streptavidin. 

An alternative to the fluorescence method for visualizing 
hybridized probes is to direct enzymes such as peroxidase, 
alkaline phosphatase of fi-galactosidase to the hybridization 
site where enzymatic conversion of soluble substrates to 30 
insoluble colored precipitates permits light microscope visu- 
alization. The important advantage of this technique is that 
the histochemical methods are 10 to 100-fold more sensitive 
than fluorescence-detection. In addition, the colored precipi- 
tates do not bleach with extensive light exposure thus 35 
avoiding one of the general disadvantages of fluorescent 
light microscopy. These enzymes can be coupled to the final 
antibody instead of fluorescent probes in the (t haptene- 
anubody sandwich" technique using bifunctional reagents 
such as glutaraldehyde or in the case of peroxidase via 4b 
oxidation of the peroxidase carbohydrate moieties to alde- 
hydes and coupling of these residues with e-amino groups of 
the desired protein. For the streptaWdin-tf otinized carrier 
protein method, an enzyme with biotinyl groups coupled to 
it could replace a fluoiescently-biotinized carrier system. 45 
Alternately, the enzyme could be coupled via biotin to the 
last layer of streptavidin with amplification of streptavidin 
sites being built up in preceding layers using biotinized BSA 
or thyroglobulin. We will begin developing the necessary 
histochemical reagents and the appropriate substrate/in- so 
soluble product combinations for visualizing in situ hybrid- 
izations without background problems in the near future. 
The histochemical approaches to signal amplification should 
therefore be ready for trial in the summer of 1981. 

Detecting and/or imaging very low levels of fluorescent 55 
light is possible using currently available image intensifiers 
or systems composed of lasers and photomultipliers. These 
methods permit the detection of light down to the level of 
individual photons. With suitable digital processing systems, 
images can be produced in which each point, i.e. each pixel, 60 
of the image is strictly proportional to the number of photons 
emitted by a point at the object Using systems of this kind 
or flow systems in which the cells or parts of cells flow past 
a laser beam, one can btain detection sensitivity increases 
for fluorescent material of factors between 100 and 1000 65 
beyond that which can be detected by the eye. This increase 
is sufficient to detect the fluorescence of single copy genes. 



In a preferred modification, analogs of dUTP and UTP 
that contain a biotin molecule covalently bound to the C-S 
position of the pyrimidine ring through an allylamine linker 
arm have been synthesized. These biotinyl-nucleotides are 
efficient substrates for a variety of DNA and RNA poly- 
merases in vitro. DNA containing low levels of biotin 
substitution (50 molecules or less/kilobase) has denatur- 
ation, reassociatibn and hybridization characteristics which 
are indistinguishable from that of unsubstituted control 
DNA 

Thus, this invention also provides a method of chromo- 
somal karyotyping. In this method, modified polynucle- 
otides are prepared which correspond to known genes and 
include modified nucleotides. These polynucleotides are 
hybridized with chromosomal deoxyribonucleic acid and the 
resulting duplexes contacted with appropriate polypeptides 
under suitable conditions to permit complex formation. The 
polypeptides include detectable moieties so that the location 
of the complexes can be determined and the location of 
specific genes thereby fixed. 

Another embodiment of this invention involves detection 
of poly A-containing sequences using poly U in which some 
of the uracil bases have been modified to contain a probe. 
Yet another embodiment involves cyclic modified nucle- 
otides in which two of 
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and z are reacted to form the cyclic moiety 

Such cyclic modified nucleotides may then be used to 

identify hormone receptor sites on cell surfaces which in 

turn can be used as a method of detecting cancer or tumor 

cells. 

Finally, tumor cells can be diagnosed by preparing poly- 
nucleotides which are modified according to this invention 
and are complementary to the messenger ribonucleic acid 
synthesized from a deoxyribonucleic acid gene sequence 
associated with the production of polypeptides, such as 
a-fetal protein or carcinoembryonic antigen, the presence of 
which is diagnostic for specific tumor cells. Hybridization 
and detection of hybrid duplexes thus would provide a 
method for detecting the tumor cells. 

The examples which follow are set forth to illustrate 
various aspects of the present invention but are not intended 
to limit in any way its scope as more particularly set forth in 
the claims. 

EXAMPLE 1 AND 2 

Synthesis of biotinyl — UTP and biotinyl— dUTP 

a) Preparation of Mercurated Nucleotides 

UTP (570 mg, 1.0 mmole) or dUTP 554 mg, 1.0 mmole) 
was dissolved in 100 ml of 0.1M sodium acetate buffer pH 
6.0, and mercuric acetate (1.59 gm, 5.0 mmoles) added. The 
solution was heated at 50° C. for 4 hours, then cooled on ice. 
Lithium chloride (392 mg, 9.0 mmoles) was added and the 
solution extracted six times with an equal volume of ethyl 
acetate to remove excess HgCl 2 . The efficiency of the 
extraction process was monitored by estimating the mercuric 
ion concentration in the organic layer using 4, 4'-bis(dim- 
ethylamino)-thi benzophenone (A. N. Christoper, Analyst, 
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94, 392 (1969). The extent of nucleotide mercuration, deter- 
mined spectrophotometrically following iodination f an 
aliquot of the aqueous solution as described by Dale et al. 
(R.M.K. Dale, D. C. Ward, D. C Livingston, and E Martin, 
Nucleic Acid Res. 2, 915 [1975]), was routinely between 90 
and 100%. The nucleotide products in the aqueous layer, 
which often became cloudy during the ethyl acetate extrac- 
tion, were precipitated by the addition of three volumes of 
ice-cold ethanol and collected by centrifugatioa The pre- 
cipitate was washed twice with cold absolute ethanol, once 
with ethyl ether, and then air dried. These thus prepared 
mercurated nucleotides were used for the synthesis of the 
allylamine-nucleotides without further purification. 

b) Synthesis of allylamine — dUTP and allylamine — UTP 
The mercurated nucleotides (of step a) were dissolved in 

0.1M sodium acetate buffer at pH 5.0, and adjusted to a 
concentration of 20 mM (200 OD/ml at 267 nm). A fresh 2.0 
M solution of allylamine acetate in aqueous acetic acid was 
prepared by slowly adding li ml of allylamine (133 
mmoles) to 8.5 ml of ice-cold 4M acetic add. Three ml (6.0 
mmoles) of the neutralized allylamine stock was added to 25 
ml (0.5 mmole) of nucleotide solution. One nucleotide 
equivalent of K 2 PdQ 4 , (163 mg, 0.5 mmole), dissolved in 4 
ml of water, was then added to initiate the reaction. Upon 
addition of the palladium salt (Alfa-Ventron) the solution 25 
gradually turned black with metal (Hg and Pd) deposits 
appearing on the walls of the reaction vessel After standing 
at room temperature for 18-24 hours, the reaction mixture 
was passed through a 0.45 mm membrane filter (nalgene) to 
remove most of the remaining metal precipitate. The yellow 
filtrate was diluted five-fold and applied to a 100 ml column 
of DEAB-Sephadex TM A-25 (Pharmacia). After washing 
with one column volume of 0.1M sodium acetate butler at 
pH 5.0, the products were eluted using a one liter linear 
gradient (0.1-0.6M) of either sodium acetate at pH -8-9, or 
triethylammonium bicarbonate (TEAB) at pH 7.5. The 
desired product was in the major UV-absorbing portion 
which eluted between 0.30 and 0.35M salt Spectral analysis 
showed that this peak contained several products, final 
purification was achieved by reverse phase— HPLC chro- 
matography on columns of Partisil— ODS2, using either 
0.5M NH^HzPO^ buffer at pH 3.3 (analytical separations), 
or 0.5M triethylammonium acetate at pH 4.3 (preparative 
separations) as eluents. Hie 5-triphosphates of 5-(3-amino- 
propen-l-yl) uridine (the allylamine adduct to uridine) were 
the last portions to be eluted from the HPLC column and 
they were clearly resolved from three, as yet uncharacter- 
ized, contaminants. These nucleotides were characterized by 
proton NMR elemental analysis [AA-dUTP (C 12 H 16 N 3 0 14 
P 3 Na^lHaO): theory C, 22.91; H, 2.88; N, 6.68; P, 14.77. 
Fbund, C, 23.10; H, 2.85; N, 6.49; P, 14.75. AA-UTP (C 12 
Hi 6 N 3 0 15 P 3 Na^HjO): Theory, C 20.61; H, 3.46; N, 
6.01; P, 13.3. Found C, 20.67; H, 4.11; N, 5.39; P, 13.54] 
spectrally and chromatographically. 

c) Biotination of AA-dUTP or AA-UTP 
Biottoyl-N-hydroxysuccinimide ester (NHSB) was pre- 
pared from biotin (Sigma) as described previously (EL 
Heitzmann and R M. Richards, Proc. Natl. Acad. Sd. USA. 
71, 3537 [1974]). AA-dUTP-HjO (63 mg, 0.1 mmole) or 
AA-UTP.4H2O (70 mg, 0.1 mmole) was dissolved in 20 ml 
of 0.1M sodium borate buffer at pH 8.5, and NHSB (34.1 
mg, 0.1 mmole) dissolved in 2 ml of dimethyl fonnamide, 
was added. Hie reaction mixture was left at room tempera- 
ture for four hours and then loaded directly onto a 30 ml 
column of DEAE-Sephadex TM A-25, preequOibrated with 
0.1M TEAB at pH 7.5. The column was eluted with a 400 
ml linear gradient (0.1-0.9M) of TEAB. Fractions contain- 



ing biotinyl-dUTP or biotinyl-UTP, which eluted between 
0.55 and 0.65M TEAB, were desalted by rotary evaporation 
in the presense of methanol and redissolved in water. Occa- 
sionally a slightly cloudy solution was obtained: this tur- 
bidity, due to a contaminant in some TEAB solutions, was 
removed by filtration through a 0.45 mm filter. For long term 
storage, the nucleotides woe converted to the sodium salt by 
briefly stirring the solution in the presence of Dowex TM 50 
(Na + form). After filtration the nucleotide was precipitated 
by the addition of three volumes of cold ethanol, washed 
with ethyl ether, dried in vacuo over sodium hydroxide 
pellets, and stored in a dessicator at -20° C For immftrfiqtft 
use, the nucleotide solution was made 20 mM in Tris-HCl at 
pH 7.5, and adjusted to a final nucleotide concentration of 5 
mM. Stock solutions were stored frozen at -20° C. 

Elemental analysis of the bio-dUTP and bio-UTP prod- 
ucts yielded the following results. Bio-dUTP (C^ N 5 
0 18 P 3 S! N^.l HaO). Theoretical; C, 29.80; H, 338; N, 
7.89; P, 10.47; S. 3.61. Found; C, 30.14 H, 3.22; N, 7.63; P, 
20 10.31;S,3.70.Bic-UTP(C 22 H 3O N 5 O 19 P 3 S l Na 4 3H 2 O): 
Tlieoretical; Q 29.15; H, 3.19; N, 7.45; P, 9.89; S, 3.41. 
Found; C, 28.76; H, 3.35; N, 7.68; P, 9.81; S, 3.32. 

The spectral properties of bio-dUTP and bio-UTP at pH 
7.5 [X max, 289 nm (e=7,100); X max, 240 nm (e=10,700); 
X min, 262 nm (e=4,300)] reflect the presence of an exocylic 
double-bond in conjugation with the pyrimidine ring. These 
nucleotides also give a strong positive reaction (an orange- 
red color) when treated with p-dimethylainirwcmnamalde- 
hyde in ethanolic sulfuric acid, a procedure used for biotin 
quantitation (D. B. McCormick and J. A. Roth, Anal Bio- 
chem., 34, 326, 1970). However, they no longer react with 
ninhydrin, a characteristic reaction of the AA-dUTP and 
AA-UTP starting materials. 
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EXAMPLES 3 AND 4 

Synthesis of biotinyl-CTP and biotinyl-dCTP 

CTP and dCTP were a) mercurated, b) reacted with 
allylamine, and c) biotinized with NHS-biotin, essentially as 
described in Example 1 . CTP (56.3 mg, 0. 1 mmole) or dCTP 
(59.1 mg, 0.1 mmole) were dissolved in 20 ml of 0.1M 
sodium acetate buffer at pH 5.0, and mercuric acetate (0.159 
gm, 0.5 mmoles) added. The solution was heated at 50° C 
for 4.5 hours then cooled on ice. Lithium chloride (39.2 mg, 
0.9 mmoles) was added and the solution extracted 6 times 
with ethyl acetate. The nucleotide products in the aqueous 
layer were precipitated by the addition of three volumes of 
cold ethanol and the precipitate collected by centrifugatioa 
The precipitate was washed with absolute ethanol, ethyl 
ether, and then air dried. These products were used without 
further purification for the synthesis of AA-CTP and AA- 
dCTP, respectively. Hie mercurated nucleotides were dis- 
solved in 0.1M sodium acetate buffer at pH 5.0 and adjusted 
to a concentration of 1 0 mM (92 OD/ml at 275 nm). 0.6 ml 
(1.2 mmole) of a 2.0M allylamine acetate stock (prepared as 
described in Example 1) was added to 10 ml of nucleotide 
solution (0.1 mmole) followed by the addition of KJ?dCl 4 
(32.6 mg, 0.1 mmole), dissolved in 1.0 ml of H^O. After 
standing at room temperature for 24 hours, the solution was 
filtered through a 0.45 mM membrane t remove metal 
precipates. The filtrate was diluted five-fold and loaded onto 
a 50 ml. column of DEAE-sephadex A-25, preequilibrated 
with 50 mM TEAB at pH 7.5. The nucleotide products were 
fractionated by applicati n of a 500 ml linear gradient 
(0.05-0.6M) of TEAB at pH 7.5. Hie desired product was in 
the major UV absorbing portion which eluted between 0.28 
and 0.38M salt The pooled samples were desalted by rotary 
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evaporation, dissolved in 0.5M triethylammonium acetate at 
pH 4.2, and final purification achieved by HPLC chroma- 
tography on columns of Partisil ODS-2, using 0.5M triethy- 
lammonium acetate as the eluenL Appropriate fractions were 
pooled, lyoprdlized, and the products dissolved in H^O. The 5 
nucleotides were converted to the Na + salt by stirring briefly 
in the presence of Dowex TM 50 (Na + form). After filtration, 
to remove the Dowex resin, the nucleotides were precipi- 
tated by the addition of 3 volumes of cold ethanoi. The 
precipitate was washed with ether and then air dried. Ana- 10 
lyrical results: AA-dCTP (C 12 H l7 N 4 O l3 P 3 Na4.2H20); 
Theory, C, 2129; H, 2.63; N, 8.67, P, 14.40. Found C, 22.16; 
H. 2.89; N. 8.77; P, 14.18. AA-CTP <C n H 17 N 4 0 a4 
Na 4 ^2H 2 0); Theory C, 21.75; H, 2.57; N, 8.46; P, 14.01. 
Found, C, 22.03; H, 247; N, 8.69; P, 13.81; Spectral 15 
properties in 0.1M Borate buffer at pH 8.0, X max 301 nm 
(€=6,400), X min 271 nm (€=3,950) X max 250 nm (c=9, 
700). Both AA-dCTP and AA-CTP give a positive mnhydrin 
test AA-CTP (6.6 mg, 0.01 mmole) or AA-dCTP (6.4 mg, 
0.01 mmole) was dissolved in 5 ml of 0.1M sodium borate 20 
buffer at pH 8.5, and NHS-biotin (3.4 mg, 0.01 mmole), 
dissolved in 0.2 ml of dimethylformamide, was added. After 
sitting at room temperature for 4 hours the sample was 
chromatographed on a 10 ml column of DEAE-Sephadex 
A-25, using a 150 ml linear gradient (0.1-O.9M) of TEAB at 25 
pH 7.5, as eluenL Fractions containing biotinyl-CTP or 
biotinyl-dCTP, which eluted between 0.50 and 0.6M TEAB, 
were pooled, desalted by rotary evaporation, and after being 
adjusted to a final concentration of 5 mM in 0.02M Tris-HQ 
buffer at pH 7.5, were frozen at -20° C. The products give 30 
a strong positive reaction for biotin with p-dimethylamino- 
cinnamldehyde in ethanolic sulfuric acid but give a negative 
test for primary amines when sprayed with ninny drin. Fur- 
ther structural characterization of these products is in 
progress. 35 

EXAMPLES 5 AND 6 

Synthesis of Iminobiotinyl— UTP and Iminobiotinyl-— 
dTJTP 40 

Iminobiotin hydrobromide was prepared from biotin as 
described previously (K. Hofmann, D. B. Melville and V. du 
Vigneaud, J. Biol. Chem, 141, 207-211, 1941; K. Hermann 
and A. E. Axelrod, Ibid., 187, 29-33, 1950). The N-hydrox- 
y suctinimide (NHS) ester of iminobiotin was prepared using 45 
the protocol previously described for the synthesis of NHS- 
Biotin (H. Heitzmann and F. M. Richards, Proc. Nat. Acad. 
Sci. USA, 71, 5537, 1974). AA-UTP (7.0 mg, 0.01 mmole) 
or AA-dUTP (6.3 mg, 0.01 mmole), prepared as detailed in 
example 1 (part b), was dissolved in 5 ml of 0.1M sodium so 
borate buffer at pH 8.5, and NHS-iminobiotin (3.5 mg, 0.01 
mmole), dissolved in 0.5 ml of dimethylformamide, was 
added The reaction mixture was left at room temperature for 
12 hours and then loaded directly onto a 10 ml column of 
DEAE-Sephadex A-25, preequilibrated with 0.05M TEAB 55 
at pH 7.5. The column was eluted with a 150 ml linear 
gradient (0.05-0.6M) of TEAB. Fractions containing imi- 
nobiotin-UTP or iminobiotin-dUTP, which eluted between 
0.35 and 0.40M TEAB, were desalted by rotary evaporation 
in the presence of methanol and dissolved in Rfi. Hie 60 
products contained a small amount of aUylamine-nucleotide 
adduct as an impurity, as judged by a weak positive result in 
the ninhydrin test Final purification was achieved by affinity 
chromatography on avidiri-sepharose. Fractions of the 
impure product, made 0.1M in soldium borate buffer at pH 65 
8 J, were applied to a 5 ml column of avidin-sepharose and 
washed with 25 ml of the same buffer. The column was then 



washed with 50 mM ammonium acetate buffer at pH 4.0, 
which eluted the desired iminobiotin-nucleotide product in a 
sharp peak. The nucleotide was precipitated by the addition 
of 3 volumes of cold ethanoi, washed with ethylether, dried 
in vacuo over sodium hydroxide pellets and stored in a 
dessicator at -20° C. Products were characterized by 
elemental analysis, as well as by spectral and chromoto- 
graphic properties. 

EXAMPLES 7 AND 8 

Synthesis of NAGE— UTP and NAGE — dUTP 

Allyl (3-amino-2-hydroxy,)propyl ether, abbreviated 
NAGE, was prepared from allyl gjycidyl ether (Age) 
(obtained from Aldrich Chemical Co. ). 10 ml of Age (84 
mmole) was added slowly (in a fume hood) to 50 ml of 9M 
ammonium hydroxide and the mixture allowed to stand at 
room temperature for six hours. Excess ammonia was 
removed by rotary evaporation under reduced pressure to 
yield a viscous yellow oil. Analysis of this product by proton 
NMR showed that it possessed the required structure. 5-mer- 
curi-dUTP (0.1 mmole) or 5-mercuri-UTP (0.2 mmole) was 
dissolved m 2-4 ml of 0.2M sodium acetate buffer at pH 5 .0, 
and a 16 fold molar-excess of NAGE adjusted to pH 5.0 with 
acetic acid prior to use, was added. The final reaction 
volumes (43 and 8.4 ml) had nucleotide concentrations of 
43 and 42 mM, respectively. One equivalent of KJPdCL* (0.1 
or 0.2 mmoles) was added to initiate the reaction After 
standing at room temperature for 18 hours, the reaction 
mixtures were filtered through 0.45 urnM membranes the 
samples diluted five-fold, and chromatographed on columns 
of DEAE-Sephadex A-25, using linear gradients (0.1-0.6M) 
of sodium acetate. Fractions containing the desired products, 
as judged by their UV spectra and characteristic HPLC 
elution profiles on Partisil ODS-2, were pooled, diluted, and 
further purified by rechromatography on DEAE-Sephadex 
using shallow gradients (0.1-0 JM) of ammonium bicarbon- 
ate at pH 8.5. Under these conditions the majority of the 
NAGE-dUTP (or NAGE-UTP)could be cleanly separated 
from residual impurities. Proton NMR spectra were obtained 
at this stage of purification after the nucleotides were 
lyophilized and redissolved in D 2 0. For elemental analysis, 
the products were converted to their sodium salt form. 
Typical analytical results: Nage-dUIP (C 13 N 3 O l6 P 3 
Na4.2 HaO), Ineory, C, 24.99; H, 3.63; N, 5.83; P, 12.88. 
Found, C, 25.39; H, 3.71; N, 5.63; P, 12.88 

EXAMPLE 9 

Uses of Labeled DN A Sequences 
I, Karyotyping 

(a) select from a human gene library some 100 to 200 
clones. Label them as described above, and for each 
clone locate its place or places of hybridization visually 
or with alow-light-level video system. For those clones 
which correspond to a unique sequence gene this 
detennines the location of the cloned DNA on a par- 
ticular human chromosome. Obtain several clones for 
each chromosome. Each of these labeled clones can be 
used to identify particular chromosomes. They can also 
be used in combination to identify each of the 46 
chromosomes as being one of the 22 autosomal pairs or 
the X or the Y. By allowing one set of labeled clones to 
hybridize to the chromosomes and then adding a fluo- 
rescent stain to the label, the set of clones and their 
locations can be visualized and will fluoresce with a 
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particular color. A second set of labeled clones could 
then be used and reacted with a second fluorescent dye. 
Hie same process can be repeated a number of times. 
Thus one can, if desired, have several sets of fluores- 
cent labels attached to the cellular DNA at different but 
specific locations on each of the chromosomes. These 
labels could be used for visual or computerized auto- 
matic karyotyping, 
(b) For automatic karyotyping, one could use one set of 
clones to identify the approximate location of each of 
the 46 chromosomes by finding sets of spots corre- 
sponding to the number of labeling sites on each 
chromosome. Thus, it is possible by computer analysis 
of the digitized images to determine if the chromo- 
somes are suitably spread for further analysis. If they 
are suitably spread then one can use computer analysis 
to identify each, of the individual chromosomes by the 
location and distribution of the labelled spots on each 
one. 

By using the fact that the fluorescent spots can be placed 
at specific locations on each chromosome, one can carry out 
either manual or automatic karyotyping very much more 
effectively than without such labels. 
II. Diagnosis of Genetic Disorders 

By selecting the clones which bind specifically to a 25 
particular chromosome, such as number 23, it is possible to 
count the number of copies of the particular chromosome in 
a cell even if the chromosomes are not condensed at 
metaphase. Thus when fetal cells are obtained for prenatal 
diagnosis of trisomy 21, the diagnosis can be done even if 
the chromosomes are not condensed at metaphase. If nec- 
essary, two sets of labels can be used — one which would be 
specific for chromosome 23 and one for some other chro- 
mosome. By measuring in each cell the ratio of the two 
labels, which might be of different colors, it is possible to 
identify the cells which show an abnormal number of 
chromosomes number 23. This procedure could be used 
either on slides with a low-light-level video system or in a 
flow cy tometer system using laser excitation. It can be used 
to determine any abnormal chromosome number. 
HL Microorganism Detection and Identification 

The labeling of specific sequences of DNA as described 
above permits identification and counting of individual 
bacteria. In order to identify the individual bacteria to which 
a particular fragment of DNA hybridizes the sensitivity must 
Le such that a single labelled structure can be detected. This 
can be done using a low-light-level video system and 
computer summation of images, or by using some other 
device for intensifying the light image. A flow system can 
also be used if the sensitivity can be made sufficiently grand. 
If one immobilized the bacteria on a slide their location 
could be found and the number of such fluorescent spots 
counted. This would provide a count of all of those bacteria 
which contain DNA which can hybridize with the specific 
cl ne utilized. Ifthe clone is selected as being specific for a 55 
particular strain or bacteria, then one can count the number 
of organisms of that strain. In addition,any antibiotic resis- 
tance for which a particular gene has been identified could 
be characterized in a similar way using, as a probe, the DNA 
sequence which is contained in the antibiotic resistance 
gene. In additions probe could be used which is specific for 
a resistance plasmid containing one or more antibiotic 
resistance genes. In addition to individual bacteria, groups of 
bacterial cells of a particular strain can be detected and their 
number estimated if they are located in a small spot so that 
the total fluorescence specific to the hybridized DNA in the 
spot can be measured. In this way the number of organisms 



containing a specific DNA sequence can be measured in a 
mixture of bacteria. 
We claim: 

1. A compound useful as a probe for detecting the 
presence or absence of a nucleic acid, said compound having 
the structure: 
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wherein B represents a purine, 7-deazapurine, or pyrimi- 
dine moiety suitable for incorporation into a polynucle- 
otide and covalently bonded to the exposition of the 
sugar moiety, provided that when B is a purine or 
7-deazapurine, the sugar moiety is attached at the 
^-position of the purine or deazapurine, and when B 
is pyrimidine, the sugar moiety is attached at the N 1 
position of the pyrimidine; 

wherein A represents at least three carbon atoms and an 
indicator molecule selected from the group consisting 
of fluorescent dyes, electron-dense reagents, enzymes 
which can be reacted with a substrate to produce a 
visually detectable reaction product, and radioisotopes; 

wherein B and A are covalently attached directly or 
through a linkage group, said linkage group not inter- 
fering substantially with detection of A; 

wherein if B is a purine, A is attached to the 8-position of 
the purine, if B is a 7-deazapurine, A is attached to the 
7-position of the deazapurine, and if B is a pyrimidine, 
A is attached to the 5-position of the pyrimidine; and 

wherein each of x, y and z represents: 



o 

II 

H-,HO^,HO— P— O- 
OH 



O O 
II II 
HO-P-O— P-0-, or 

OH OH 



0 O O 
II II tl 

HO-P— O— P— O-P-O— . 

1 I I 
OH OH OH 



2. A compound useful as a probe for detecting the 
presence or absence of a nucleic acid, said compound 
containing a nucleotide having the structure: 



x-CH 2 0 B — A 
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wherein B represents a purine, 7-deazapurine, or pyrimi- 
dine moiety covalently bonded to the exposition of 
the sugar moiety, provided that whenever B is purine or 
7-deazapurine, the sugar moiety is attached at the 
N'-position of the purine or deazapurine, and whenever 
B is a pyrimidine, the sugar moiety is attached at the 
N'-position of the pyrimidine; 

wherein A represents at least three carbon atoms and an 
indicator molecule selected from the group consisting 
of fluorescent dyes, electron-dense reagents, enzymes 
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which can be reacted with a substrate to produce a 
visually detectable reaction product and radioisotopes; 

wherein B and A are covalently attached directly or 
through a linkage group, said linkage group not inter- 
fering substantially with detection of A; 

wherein if B is a purine, A or the linkage group is attached 
to the 8-position of the purine, if B is a 7-deazapurine, 
A or the linkage group is attached to the 7-position of 
the deazapurine, and if B is a pyrimidine, A or the 
linkage group is attached to the 5-position of the 
pyrimidine; 

wherein one of x and y represents 
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and the other of x and y is absent or represents — OH or — H; 
and 

wherein z represents H — or HO — . 

3. A complex useful as a probe for detecting the presence 25 
or absence of a nucleic acid, said complex comprising a 
detectable polypeptide complexed with a compound having 
the structure: 
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4. A complex in accordance with claim 3 wherein said 
detectable polypeptide is linked to an indicator molecule 
selected from the group consisting of fluorescent dyes, 
electron-dense reagents, enzymes which can be reacted with 
a substrate to produce a visually detectable reaction product, 
and radioisotopes. 

5. A complex in accordance with claim 4 wherein said 
detectable polypeptide is a fluorescent dye, electron dense 
reagent, or enzyme which can be reacted with a substrate to 
produce a visually detectable reaction product 

6. A complex useful as a probe for detecting the presence 
or absence of a nucleic acid, said complex comprising a 
detectable polypeptide complexed with an oligo- or poly- 
nucleotide containing a nucleotide having the structure: 



x-CH 2 n b — A 
o 



y 2 




x-CH2q b — A 



y * 
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wherein B represents a purine, 7-deazapurine, or pyrimi- 
dine moiety suitable for incorporation into a polynucle- 
otide and covalently bonded to the exposition of the 
sugar moiety, provided that when B is a purine or 40 
7-deazapurine, the sugar moiety is attached at the 
^-position of the purine or deazapurine, and when B 
is pyrimidine, the sugar moiety is attached at the N 1 
position of the pyrimidine; 

wherein A represents at least three carbon atoms, is 
capable of specifically complexing with the detectable 
polypeptide when A is linked to B, and represents a 
component of a signalling moiety capable of producing 
a detectable signal; 

wherein B and A are covalently attached directly or 
through a linkage group, said linkage group not inter- 
fering substantially with the characteristic ability of A 
to form said complex with the detectable polypeptide; 

wherein if B is a purine, A is attached to the 8-position of 
the purine, if B is a 7-deazapurine, A is attached to the 
7-position of the deazapurine, and if B is a pyrimidine, 
A is attached to the 5-position of the pyrimidine; and 

wherein each of x, y and z represents: 60 
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wherein B represents a purine, 7-deazapurine, or pyrimi- 
dine moiety covalently bonded to the (^'-position of 
the sugar moiety, provided that whenever B is purine or 
7-deazapurine, the sugar moiety is attached at the 
Imposition of the purine or deazapurine, and whenever 
B is a pyrimidine, the sugar moiety is attached at the 
^-position of the pyrimidine; 

wherein A represents at least three carbon atoms, is 
capable of specifically complexing with the detectable 
polypeptide when A is linked to B, and represents a 
component of a signalling moiety capable of producing 
a detectable signal; 

wherein B and A are covalently attached directly or 
through a linkage group, said linkage group not inter- 
fering substantially with the characteristic ability of A 
to form said complex with the detectable polypeptide; 

wherein if B is a purine, A or the linkage group is attached 
to the 8-position of the purine, if B is a 7-deazapurine, 
A or the linkage group is attached to the 7-position of 
the deazapurine, and if B is a pyrimidine, A or the 
linkage group is attached to the 5-position of the 
pyrimidine; 

wherein one of x and y represents 



0 o 
II II 

-0-P— 0-or -O-P-O- 

1 I 
OH 0" 



and the other of x and y is absent or represents —OH or — H; 



0 00 

II II II 

H— , HO— ( HO-P-0—, HO-P-0— P— O— , or 
OH OH OH 



wherein z represents H — or HO—. 
7. A complex in accordance with claim 6 wherein said 
65 detectable polypeptide is linked to an indicator molecule 
selected from the group consisting of fluorescent dyes, 
electron-dense reagents, enzymes which can be reacted with 
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a substrate to produce a visually detectable reaction product, 
and radioisotopes. 

8. A complex in accordance with claim 7 wherein said 
detectable polypeptide is a fluorescent dye, electron dense 
reagent, or enzyme which can be reacted with a substrate to 5 
produce a visually detectable reaction product 

9. A complex in accordance with claims 3 or 6 wherein A 
is a ligand. 

10. A complex according to claim 9 wherein said ligand 

is selected from the group consisting of biotin, inunobiotin, 10 
or a cof actor. 

11. A complex in accordance with claim 9 wherein said 
ligand is selected from the group consisting of antigens, 
antibodies and haptens. 

12. A complex in accordance with claim 11 wherein said is 
ligand is dinitrophenol. 

13. A complex according to claims 3 or 6 wherein the 
detectable polypeptide is indirectly detectable by specifi- 
cally complexing the detectable polypeptide with a second 
moiety covalently linked to an indicator molecule selected 20 
from the group consisting of fluorescent dyes, electron- 
dense reagents, enzymes which can be reacted with a 
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substrate to produce a visually detectable reaction product, 
and radioisotopes. 

14. A complex according to claim 13 wherein the detect- 
able polypeptide is selected from the group consisting of 
avidin and streptavidin and the second moiety is selected 
from the group consisting of biotin and imin biotirL 

15. A complex according to claim 14 wherein the indi- 
cator molecule is an enzyme which can be reacted with a 
substrate to produce a visually detectable reaction product 

16. The complex of claims 3 or 6 wherein said linkage 
group is comprises a — CHj— NH — , 

17. The complex of claims 3 or 6 wherein said linkage 
group is selected from the group consisting of — CH=CH— 
CHj — NH — and -^==CH--CH 2 -0— CHa— 
CH(OH) — CHj — NH — . 

18. The complex of claims 3 or 6 wherein A is an 
allylamine group linked directly to B. 

19. The complex of claims 3 or 6 wherein A is a moiety 
comprising an olefinic bond. 



